Binary Classification on Past Due of Service Accounts using Logistic Regression and Decision Tree

نویسندگان

  • Yan Wang
  • Jennifer L. Priestley
  • Jennifer Lewis Priestley
چکیده

This paper aims at predicting businesses’ past due in service accounts as well as determining the variables that impact the likelihood of repayment. Two binary classification approaches, logistic regression and the decision tree, were conducted and compared. Both approaches have very good performances with respect to the accuracy. However, the decision tree only uses 10 predictors and reaches an accuracy of 96.69% on the validation set while logistic regression includes 14 predictors and reaches an accuracy of 94.58%. Due to the large concern of false negatives in financial industry, the decision tree technique is a better option than logistic regression on the given dataset in terms of its relative lower false negative. Accuracy, false positive and false negative are all very important criteria in model selection and evaluation. Decision making should rely more on the research purpose, rather than on the exact values of these criteria. Keywords—Past Due, Binary Classification, Logistic Regression, Decision Tree

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ranking stocks of listed companies on Tehran stock exchange using a hybrid model of decision tree and logistic regression

Much research has introduced linear or nonlinear models using statistical models and machine learning tools in artificial intelligence to estimate Iran's rate of return. The primary purpose of these methods is simultaneously use different independent variables to improve stock return rates' modeling. However, in predicting the rate of return, in addition to the modeling method, the degree of co...

متن کامل

مقایسه دقت پیش‌بینی رگرسیون لجستیک و درخت رده‌بندی در تعیین عوامل خطر و پیش‌بینی ابتلا به سرطان پستان

Background and Objectives: Breast cancer is one of the most common malignancies in women which accounts for the highest number of deaths after lung cancer. The aim of the current study was to compare the logistic regression and classification tree models in determining the risk factors and prediction of breast cancer. Methods: We used from the data of a case-control study conducted on 303 pa...

متن کامل

Comparing the Results of Logistic Regression Model and Classification and Regression Tree Analysis in Determining Prognostic Factors for Coronary Artery Disease in Mashhad, Iran

Background and purpose: Understanding of the risk factors for cardiovascular artery disease, which is the leading cause of death worldwide, can lead to essential changes in its etiology, prevalence, and treatment. The aim of this study was to compare the results of logistic regression model and Classification and Regression Tree Analysis (CART) in determining the prognostic factors for coronary...

متن کامل

مقایسه مدل درخت تصمیم و رگرسیون لوجستیک در ارزیابی پوکی استخوان

Introduction: Early detection of osteoporosis is a key to preventing of it; but recognition, without the use of appropriate diagnostic methods, due to the complexity of risk factors and gradual bone loss process, is problem. The purpose of this study is to develop and efficiency evaluation a predictive model of osteoporosis using decision tree technique as a diagnostic method based on available...

متن کامل

Predicting The Type of Malaria Using Classification and Regression Decision Trees

Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017